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H. grisea CBH1.1 

Figure 1: Total Genomic Sequence (1638 nucleotides): 



1 


ATGCGTACCG 


CCAAGTTCGC 


CACCCTCGCC 


GCCCTTGTGG 


CCTCGGCCGC 


50 


51 


CGCCCAGCAG 


GCGTGCAGTC 


TCACCACCGA 


GAGGCACCCT 


TCCCTCTCTT 


100 


101 


GGAAGAAGTG 


CACCGCCGGC 


GGCCAGTGCC 


AGACCGTCCA 


GGCTTCCATC 


150 


151 


ACTCTCGACT 


CCAACTGGCG 


CTGGACTCAC 


CAGGTGTCTG 


GCTCCACCAA 


200 


201 


CTGCTACACG 


GGCAACAAGT 


GGGATACTAG 


CATCTGCACT 


GATGCCAAGT 


250 


251 


CGTGCGCTGA 


GAACTGCTGC 


GTCGATGGTG 


CCGACTACAC 


CAGCACCTAT 


300 


301 


GGCATCACCA 


CCAACGGTGA 


TTCCCTGAGC 


CTCAAGTTCG 


TCACCAAGGG 


350 


351 


CCAGCACTCG 


ACCAACGTCG 


GCTCGCGTAC 


CTACCTGATG 


GACGGCGAGG 


400 


401 


ACAAGTATCA 


GAGTACGTTC 


TATCTTCAGC 


CTTCTCGCGC 


CTTGAATCCT 


450 


451 


GGCTAACGTT 


TACACTTCAC 


AGCCTTCGAG 


CTCCTCGGCA 


ACGAGTTCAC 


500 


501 


CTTCGATGTC 


GATGTCTCCA 


ACATCGGCTG 


CGGTCTCAAC 


GGCGCCCTGT 


550 


551 


ACTTCGTCTC 


CATGGACGCC 


GATGGTGGTC 


TCAGCCGCTA 


TCCTGGCAAC 


600 


601 


AAGGCTGGTG 


CCAAGTACGG 


TACCGGCTAC 


TGCGATGCTC 


AGTGCCCCCG 


650 


651 


TGACATCAAG 


TTCATCAACG 


GCGAGGCCAA 


CATTGAGGGC 


TGGACCGGCT 


700 


701 


CCACCAACGA 


CCCCAACGCC 


GGCGCGGGCC 


GCTATGGTAC 


CTGCTGCTCT 


750 


751 


GAGATGGATA 


TCTGGGAAGC 


CAACAACATG 


GCTACTGCCT 


TCACTCCTCA 


800 


801 


CCCTTGCACC 


ATCATTGGCC 


AGAGCCGCTG 


CGAGGGCGAC 


TCGTGCGGTG 


850 


851 


GCACCTACAG 


CAACGAGCGC 


TACGCCGGCG 


TCTGCGACCC 


CGATGGCTGC 


900 


901 


GACTTCAACT 


CGTACCGCCA 


GGGCAACAAG 


ACCTTCTACG 


GCAAGGGCAT 


950 


951 


GACCGTCGAC 


ACCACCAAGA 


AGATCACTGT 


CGTCACCCAG 


TTCCTCAAGG 


1000 


1001 


ATGCCAACGG 


CGATCTCGGC 


GAGATCAAGC 


GCTTCTACGT 


CCAGGATGGC 


1050 


1051 


AAGATCATCC 


CCAACTCCGA 


GTCCACCATC 


CCCGGCGTCG 


AGGGC AATTC 


1100 


1101 


CATCACCCAG 


GACTGGTGCG 


ACCGCCAGAA 


GGTTGCCTTT 


GGCGACATTG 


1150 


1151 


ACGACTTCAA 


CCGCAAGGGC 


GGCATGAAGC 


AG ATGGG C AA 


GGCCCTCGCC 


1200 


1201 


GGCCCCATGG 


TCCTGGTCAT 


GTCCATCTGG 


GATGACCACG 


CCTCCAACAT 


1250 


1251 


GCTCTGGCTC 


GACTCGACCT 


TCCCTGTCGA 


TGCCGCTGGC 


AAGCCCGGCG 


1300 


1301 


CCGAGCGCGG 


TGCCTGCCCG 


ACCACCTCGG 


GTGTCCCTGC 


TGAGGTTGAG 


1350 


1351 


GCCGAGGCCC 


CCAACAGCAA 


CGTCGTCTTC 


TCCAACATCC 


GCTTCGGCCC 


1400 


1401 


CATCGGCTCG 


ACCGTTGCTG 


GTCTCCCCGG 


CGCGGGCAAC 


GGCGGCAACA 


1450 


1451 


ACGGCGGCAA 


CCCCCCGCCC 


CCCACCACCA 


CCACCTCCTC 


GGCTCCGGCC 


1500 


1501 


ACCACCACCA 


CCGCCAGCGC 


TGGCCCCAAG 


GCTGGCCGCT 


GGCAGCAGTG 


1550 


1551 


CGGCGGCATC 


GGCTTCACTG 


GCCCGACCCA 


GTGCGAGGAG 


CCCTACACTT 


1600 


1601 


GCACCAAGCT 


CAACGACTGG 


TACTCTCAGT 


GCCTGTAA 
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Figure 2: Putative intron sequence deleted (GTACGTT. . . CAG = 413-472) Gives the cDNA 

sequence (1578 nucleotides): 



1 


ATGCGTACCG 


CCAAGTTCGC 


CACCCTCGCC 


GCCCTTGTGG 


CCTCGGCCGC 


50 


51 


CGCCCAGCAG 


GCGTGCAGTC 


TCACCACCGA 


GAGGCACCCT 


TCCCTCTCTT 


100 


101 


GGAAGAAGTG 


CACCGCCGGC 


GGCCAGTGCC 


AGACCGTCCA 


GGCTTCCATC 


150 


151 


ACTCTCGACT 


CCAACTGGCG 


CTGGACTCAC 


CAGGTGTCTG 


GCTCCACCAA 


200 


201 


CTGCTACACG 


GGCAACAAGT 


GGGATACTAG 


CATCTGCACT 


GATGCCAAGT 


250 


251 


CGTGCGCTCA 


GAACTGCTGC 


GTCGATGGTG 


CCGACTACAC 


CAGCACCTAT 


300 


301 


GGCATCACCA 


C CAACGGTG A 


TTCCCTGAGC 


CTCAAGTTCG 


T C AC C AAGGG 


350 


351 


CCAGCACTCG 


ACCAACGTCG 


GCTCGCGTAC 


CTACCTGATG 


GACGGCGAGG 


400 


401 


ACAAGTATCA 


GACCTTCGAG 


CTCCTCGGCA 


ACGAGTTCAC 


CTTCGATGTC 


450 


451 


GATGTCTCCA 


ACATCGGCTG 


CGGTCTCAAC 


GGCGCCCTGT 


ACTTCGTCTC 


500 


501 


CATGGACGCC 


GATGGTGGTC 


TCAGCCGCTA 


TCCTGGCAAC 


AAGGCTGGTG 


550 


551 


CCAAGTACGG 


TACCGGCTAC 


TGCGATGCTC 


AGTGCCCCCG 


TGACATCAAG 


600 


601 


TTCATCAACG 


GCGAGGCCAA 


CATTGAGGGC 


TGGACCGGCT 


CCACCAACGA 


650 


651 


CCCCAACGCC 


GGCGCGGGCC 


GCTATGGTAC 


CTGCTGCTCT 


GAGATGGATA 


700 


701 


TCTGGGAAGC 


CAACAACATG 


GCTACTGCCT 


TCACTCCTCA 


CCCTTGCACC 


750 


751 


ATCATTGGCC 


AGAGCCGCTG 


CGAGGGCGAC 


TCGTGCGGTG 


G C AC C T AC AG 


800 


801 


CAACGAGCGC 


TACGCCGGCG 


TCTGCGACCC 


CGATGGCTGC 


GACTTCAACT 


850 


851 


CGTACCGCCA 


GGGCAACAAG 


ACCTTCTACG 


G C AAGGG CAT 


GACCGTCGAC 


900 


901 


ACCACCAAGA 


AGATCACTGT 


CGTCACCCAG 


TTCCTCAAGG 


ATGCCAACGG 


950 


951 


CGATCTCGGC 


GAGATCAAGC 


GCTTCTACGT 


CCAGGATGGC 


AAGATCATCC 


1000 


1001 


CCAACTCCGA 


GTCCACCATC 


CCCGGCGTCG 


AGGGCAATTC 


CATCACCCAG 


1050 


1051 


GACTGGTGCG 


ACCGCCAGAA 


GGTTGCCTTT 


GGCGACATTG 


ACGACTTCAA 


1100 


1101 


CCGCAAGGGC 


GGCATGAAGC 


AGATGGGCAA 


GGCCCTCGCC 


GGCCCCATGG 


1150 


1151 


TCCTGGTCAT 


GTCCATCTGG 


GATGACCACG 


CCTCCAACAT 


GCTCTGGCTC 


1200 


1201 


GACTCGACCT 


TCCCTGTCGA 


TGCCGCTGGC 


AAGCCCGGCG 


CCGAGCGCGG 


1250 


1251 


TGCCTGCCCG 


ACCACCTCGG 


GTGTCCCTGC 


TGAGGTTGAG 


GCCGAGGCCC 


1300 


1301 


CCAACAGCAA 


CGTCGTCTTC 


TCCAACATCC 


GCTTCGGCCC 


CATCGGCTCG 


1350 


1351 


ACCGTTGCTG 


GTCTCCCCGG 


CGCGGGCAAC 


GGCGGCAACA 


ACGGCGGCAA 


1400 


1401 


CCCCCCGCCC 


CCCACCACCA 


CCACCTCCTC 


GGCTCCGGCC 


ACCACCACCA 


1450 


1451 


CCGCCAGCGC 


TGGCCCCAAG 


GCTGGCCGCT 


GGCAGCAGTG 


CGGCGGCATC 


1500 


1501 


GGCTTCACTG 


GCCCGACCCA 


GTGCGAGGAG 


CCCTACACTT 


GCACCAAGCT 


1550 


1551 


CAACGACTGG 


TACTCTCAGT 


GCCTGTAA 






1578 
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Figure 3: Translation of the cDNA sequence gives the H. grisea var thermoidea CBH1 
precursor (i.e.: with signal sequence) protein sequence (525 amino acids): 



1 


MRTAKFATLA 


ALVASAAAQQ 


ACSLTTERHP 


SLSWKKCTAG 


GQCQTVQAS I 


50 


51 


TLDSNWRWTH 


QVSGSTNCYT 


GNKWDTSICT 


DAKS CAQNCC 


VDGADYTSTY 


100 


101 


GITTNGDSLS 


LKFVTKGQHS 


TNVGSRTYLM 


DGEDKYQTFE 


LLGNEFTFDV 


150 


151 


DVSNIGCGLN 


GALYFVSMDA 


DGGLSRYPGN 


KAGAKYGTGY 


CDAQCPRDIK 


200 


201 


FINGEANIEG 


WTGSTNDPNA 


GAGRYGTCCS 


EMDIWEANNM 


ATAFTPHPCT 


250 


251 


IIGQSRCEGD 


SCGGTYSNER 


YAGVCDPDGC 


DFNSYRQGNK 


TFYGKGMTVD 


300 


301 


TTKKITWTQ 


FLKDANGDLG 


EIKRFYVQDG 


KIIPNSESTI 


PGVEGNSITQ 


350 


351 


DWCDRQKVAF 


GDIDDFNRKG 


GMKQMGKALA 


GPMVLVMSIW 


DDHASNMLWL 


400 


401 


DSTFPVDAAG 


KPGAERGACP 


TTSGVPAEVE 


AEAPNSNWF 


SNIRFGPIGS 


450 


451 


TVAGLPGAGN 


GGNNGGNPPP 


PTTTTSSAPA 


TTTTASAGPK 


AGRWQQCGGI 


500 


501 


GFTGPTQCEE 


PYTCTKLNDW 


YSQCL 






525 



I 

I 



< 



Variant Humicola Grisea CBH1.1 
Goedegebuur et al. 

SN# Unassigned ; 
Docket No. GC794-2 ' 
Sheet 4 of 18 | 

Figure 4: Mature (i.e.: expressed protein with the putative signal sequence removed) protein 
sequence (507 amino acids): 



1 QQACSLTTER HPSLSWKKCT AGGQCQTVQA SITLDSNWRW THQVSGSTNC 5 0 

51 YTGNKWDTS I CTDAKSCAQN CCVDGADYTS TYGITTNGDS LSLKFVTKGQ 100 

101 HSTNVGSRTY LMDGEDKYQT FELLGNEFTF DVDVSNIGCG LNGALYFVSM 15 0 

151 DADGGLSRYP GNKAGAKYGT GYCDAQCPRD I KF INGE AN I EGWTGSTNDP 2 00 

2 01 NAGAGRYGTC CSEMDIWEAN NMATAFTPHP CTIIGQSRCE GDSCGGTYSN 25 0 

251 ERYAGVCDPD GCDFNSYRQG NKTFYGKGMT VDTTKKITW TQFLKDANGD 3 00 

301 LGEIKRFYVQ DGKIIPNSES TIPGVEGNSI TQDWCDRQKV AFGD IDDFNR 350 

351 KGGMKQMGKA LAG PMVL* VMS IWDDHASNML WLDSTFPVDA AGKPGAERGA 4 00 

4 01 CPTTSGVPAE VEAEAPNSNV VFSNIRFGPI GSTVAGLPGA GNGGNNGGNP 45 0 

451 PPPTTTTSSA PATTTTASAG PKAGRWQQCG GIGFTGPTQC EEPYTCTKLN 500 

501 DWYSQCL 507 
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g la promoter g la terminator 




Figure 6: pRAXl 
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Figure 7: Destination vector pRAXdes2 for expression in A, niger 
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Figure 8: Replicative expression pRAXdesCBHl vector of CBH1 genes under the control of the 
glucoamylase promotor. 
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Figure 9: CBHI Conversion Assay: 
38C, 700 rpm agitation, 1 day, 12.66% PCS, 15.5 mg enzyme/g cellulose 
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Figure 10: CBHI Conversion Assay: 
65C, 700 rpm agitation, 1 day, 12.66% PCS, 15.5 mg enzyme/g cellulose 
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Figure 1 1: CBHI Conversion Assay: 
38C, 700 rpm agitation, 1 day, 12.66% PCS, 15.5 mg enzyme/g cellulose 




i Figure 12: CBHI Conversion Assay: 

! 65C, 700 rpm agitation, 1 day, 12.66% PCS, 15.5 mg enzyme/g cellulose 
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1 ATGCGTACCG CCAAGTTCGC CACCCTCGCC GCCCTTGTGG CCTCGGCCGC 
51 CGCCCAGCAG GCGTGCAGCC TCACCACCGA GAGGCACCCT TCCCTCTCCT 
101 GGAAGAAGTG CACCGCCGGC GGCCAGTGCC AGACCGTCCA GGCTTCCATC 
1 5 1 ACTCTCGACT CCAACTGGCG CTGGACTCAC CAGGTGTCTG GCTCCACCAA 
201 CTGCTACACG GGCAACGAGT GGGATTCTAG CATCTGCACT GATGC CAAGT 
251 CGTGCGCTCA GAACTGCTGC GTCGATGGTG CTGACTACAC CAGCACCTAT 
3 01 GGCATCACCA CCAACGGTGA TTCCCTGAGC CTCAAGTTCG TCACCAAGGG 

3 51 CCAGTACTCG ACCAACGTCG GCTCGCGTAC CT AC CTGATG GACGGCGAGG 

4 01 ACAAGTATCA GAGTAGGTTC TATCTTCAGC CTTCTCGCGC CTTGAAT C CT 
4 51 GGCTAACTTT TACACTTCAC AGCCTTCGAG CTCCTCGGCA ACGAGTTCAC 
501 CTTCGATGTC GATGTCTCCA ACATCGGCTG CGGTCTCAAC GGCGCCCTGT 
551 ACTTCGTCTC CATGGACGCC GATGGTGGTC TCAGCCGCTA TCCTGGCAAC 
601 AAGGCTGGTG CCAAGTACGG TACCGGCTAC TGCGATGCTC AGTGCCCCCG 
651 TGACATCAAG TTCATCAACG GCGAGGCCAA CATTGAGGGC TGGACCGGCT 
701 CCACCAACGA CCCCAACGCC GGCGCGGGCC GCTATGGTAC CTGCTGCTCT 
751 GAGATGGATA TCTGGGAGGC CAACAACATG GCTACTGCCT TCACTCCTCA 
801 CCCTTGCACT ATCATTGGCC AGAGCCGCTG CGAGGGCGAC TCGTGCGGTG 
851 GCACCTACAG CAACGACCGC TACGCCGGCG TCTGCGACCC CGATGGCTGC 
901 GACTTCAACG CGTATCGCCA GGGCAACAAG ACCTTCTACG GCAAGGGCAT 
951 GACCGTCGAC ACCACCAAGA AGCTCACCGT CGTCACCCAG TTCCTCAAGG 

1001 ACGCCAACGG CGATCTCGGC GAGATCAAGC GCTTCTACGT CCAGGATGGG 
1051 AAGATCATCC CCAACTCCGA GTCCACCATC CCCGGCGTCG AGGGCAACTC 
1101 CATCACCCAG GATTGGTGCG ACCGCCAGAA GGTTGC CTTT GGCGACATTG 
1151 ACGACTTCAA CCGCAAGGGC GGCATGAAGC AGATGGGCAA GGCCCTCGCC 

12 01 GGCCCCATGG TCCTGGTCAT GTCCATCTGG GATGAC CACG CCTCCAACAT 
1251 GCTCTGGCTC GACTCGACCT TCCCTGTCGA TGCCGCTGGC AAGCCCGGCG 

13 01 CCGAGCGCGG TGCCTGCCCG ACCACCTCGG GTGTCCCTGC TGAGGTTGAG 

13 51 GCCGAGGCCC CCAACAGCAA CGTCGTCTTC TCCAACATCC GCTTCGGCCC 

14 01 CATCGGCTCG ACCGTTGCCG GCCTTCCCAG CGATGGCGGC AACAACGGCG 
14 51 GCAACACCAC CGTCCAGCCC CCGCCCAGCA CCACCACCAC CTCTGCCAGC 
1501 AGCAGCACCA CCTCGGCTCC TGCCACCACC ACCACCGCCA GCGCTGGCCC 
1551 CAAGGCTGGC CGCTGGCAGC AGTGCGGCGG CATCGGCTTC ACTGGCCCGA 
1601 CCCAGTGCGA GGAGCCCTAC ACTTGCACCA AGCTCAACGA CTGGTACTCT 
1651 CAGTGCCTGT AA 



Figure 14A 

Scytalidium thermophilium CBH1 Genomic DNA 
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1 ATGCGTACCG CCAAGTTCGC 

CGCCCAGCAG GCGTGCAGCC 
101 GGAAGAAGTG CACCGCCGGC 

ACTCTCGACT CCAACTGGCG 
201 CTGCTACACG GGCAACGAGT 

CGTGCGCTCA GAACTGCTGC 
301 GGCATCACCA CCAACGGTGA 

CCAGTACTCG ACCAACGTCG 
401 AC AAGT AT C A GACCTTCGAG 

GATGTCTCCA ACATCGGCTG 
501 CATGGACGCC GATGGTGGTC 

CCAAGTACGG TACCGGCTAC 
601 TTCATCAACG GCGAGGCCAA 

CCCCAACGCC GGCGCGGGCC 
701 TCTGGGAGGC CAACAACATG 

ATCATTGGCC AGAGCCGCTG 
801 CAACGACCGC TACGCCGGCG 

CGTATCGCCA GGGCAACAAG 
901 ACCACCAAGA AGCTCACCGT 

CGATCTCGGC GAGATCAAGC 
1001 CCAACTCCGA GTCCACCATC 

GATTGGTGCG ACCGCCAGAA 
1101 CCGCAAGGGC GGCATGAAGC 

TCCTGGTCAT GTCCATCTGG 
1201 GACTCGACCT TCCCTGTCGA 

TGCCTGCCCG ACCACCTCGG 

13 01 CCAACAGCAA CGTCGTCTTC 

ACCGTTGCCG GCCTTCCCAG 

14 01 CGTCCAGCCC CCGCCCAGCA 

CCTCGGCTCC TGCCACCACC 
1501 CGCTGGCAGC AGTGCGGCGG 

GGAGCCCTAC ACTTGCACCA 
1601 AA 



CACCCTCGCC GCCCTTGTGG CCTCGGCCGC 
TCACCACCGA GAGGCACCCT TCCCTCTCCT 
GGCCAGTGCC AGACCGTCCA GGCTTCCATC 
CTGGACTCAC CAGGTGTCTG GCTCCACCAA 
GGGATTCTAG CATCTGCACT GATGC C AAGT ■ 
GTCGATGGTG CTGACTACAC CAGCACCTAT 
TTCCCTGAGC CTCAAGTTCG TCACCAAGGG 
GCTCGCGTAC CTACCTGATG GACGGCGAGG 
CTCCTCGGCA ACGAGTTCAC CTTCGATGTC 
CGGTCTCAAC GGCGCCCTGT ACTTCGTCTC 
TCAGCCGCTA TCCTGGCAAC AAGGCTGGTG 
TGCGATGCTC AGTGCCCCCG TGACATCAAG 
CATTGAGGGC TGGACCGGCT CCACCAACGA 
GCTATGGTAC CTGCTGCTCT GAGATGGATA 
GCTACTGCCT TCACTCCTCA CCCTTGCACT 
CGAGGGCGAC TCGTGCGGTG GC AC CTACAG 
TCTGCGACCC CGATGGCTGC GACTTCAACG 
ACCTTCTACG GCAAGGGCAT GACCGTCGAC 
CGTCACCCAG TTCCTCAAGG ACGC CAACGG 
GCTTCTACGT CCAGGATGGG AAGATCATCC 
CCCGGCGTCG AGGGCAACTC CATCACCCAG 
GGTTGCCTTT GGCGACATTG ACGACTTCAA 
AGATGGGCAA GGCCCTCGCC GGCCCCATGG 
GATGACCACG CCTCCAACAT GCTCTGGCTC 
TGC CGCTGGC AAGCCCGGCG CCGAGCGCGG 
GTGTCCCTGC TGAGGTTGAG GCCGAGGCCC 
TCCAACATCC GCTTCGGCCC CATCGGCTCG 
CGATGGCGGC AACAACGGCG GCAACACCAC 
CCACCACCAC CTCTGCCAGC AGCAGCACCA 
ACCACCGCCA GCGCTGGCCC CAAGGCTGGC 
CATCGGCTTC ACTGGCCCGA CCCAGTGCGA 
AGCTCAACGA CTGGTACTCT CAGTGC CTGT 



Figure 14B 

Scytalidium thermophilium CBH1 cDNA 
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MRTAKFATLAALVASAAAQQACSLTTERHPSLSWKKCTAGGQCQTVQASI 5 0 



5 1 T LD SNWRWTHQ VSG S TNC YTGNE WD SSI CTD AKS CAQNC C VDGAD YT STY 100 



101 GITTNGDSLSLKFVTKGQYSTNVGSRTYLMDGEDKYQTFELLGNEFTFDV 150 



151 DVSNIGCGLNGALYFVSMDADGGLSRYPGNKAGAKYGTGYCDAQCPRDI K 2 0 0 



2 01 FINGEANIEGWTGSTNDPNAGAGRYGTCCSEMDI WEANNMATAFTPHPCT 2 50 



2 51 I I GQ S RC EGD S CGG T YSNDR YAGVCD PDGCD FNA YRQGNKT F YGKGMT VD 3 0 0 



3 01 TTKKLTWTQFLKDANGDLGE I KRFYVQDGKI I PNSESTI PGVEGNS I TQ 3 50 



3 51 DWCDRQKVAFGDIDDFNRKGGMKQMGKALAGPMVLVMS I WDDHASNMLWL 4 0 0 



4 01 DSTFPVDAAGKPGAERGACPTTSGVPAEVEAEAPNSNWFSNIRFGPIGS 4 5 0 



4 51 TVAGLPSDGGNNGGNTTVQPPPSTTTTSASSSTTSAPATTTTASAGPKAG 50 0 



501 RWQQCGGIGFTGPTQCEEPYTCTKLNDWYSQCL- 534 



Figure 14C 

Scytalidium thermophilum CBHI, including signal sequence 
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